101 research outputs found

    Investigation of associations between retinal microvascular parameters and albuminuria in UK Biobank: a cross-sectional case-control study

    Get PDF
    BACKGROUND: Associations between microvascular variation and chronic kidney disease (CKD) have been reported previously. Non-invasive retinal fundus imaging enables evaluation of the microvascular network and may offer insight to systemic risk associated with CKD. METHODS: Retinal microvascular parameters (fractal dimension [FD] - a measure of the complexity of the vascular network, tortuosity, and retinal arteriolar and venular calibre) were quantified from macula-centred fundus images using the Vessel Assessment and Measurement Platform for Images of the REtina (VAMPIRE) version 3.1 (VAMPIRE group, Universities of Dundee and Edinburgh, Scotland) and assessed for associations with renal damage in a case-control study nested within the multi-centre UK Biobank cohort study. Participants were designated cases or controls based on urinary albumin to creatinine ratio (ACR) thresholds. Participants with ACR ≥ 3 mg/mmol (ACR stages A2-A3) were characterised as cases, and those with an ACR < 3 mg/mmol (ACR stage A1) were categorised as controls. Participants were matched on age, sex and ethnic background. RESULTS: Lower FD (less extensive microvascular branching) was associated with a small increase in odds of albuminuria independent of blood pressure, diabetes and other potential confounding variables (odds ratio [OR] 1.18, 95% confidence interval [CI] 1.03-1.34 for arterioles and OR 1.24, CI 1.05-1.47 for venules). Measures of tortuosity or retinal arteriolar and venular calibre were not significantly associated with ACR. CONCLUSIONS: This study supports previously reported associations between retinal microvascular FD and other metabolic disturbances affecting the systemic vasculature. The association between retinal microvascular FD and albuminuria, independent of diabetes and blood pressure, may represent a useful indicator of systemic vascular damage associated with albuminuria

    The Dawn of Open Access to Phylogenetic Data

    Get PDF
    The scientific enterprise depends critically on the preservation of and open access to published data. This basic tenet applies acutely to phylogenies (estimates of evolutionary relationships among species). Increasingly, phylogenies are estimated from increasingly large, genome-scale datasets using increasingly complex statistical methods that require increasing levels of expertise and computational investment. Moreover, the resulting phylogenetic data provide an explicit historical perspective that critically informs research in a vast and growing number of scientific disciplines. One such use is the study of changes in rates of lineage diversification (speciation - extinction) through time. As part of a meta-analysis in this area, we sought to collect phylogenetic data (comprising nucleotide sequence alignment and tree files) from 217 studies published in 46 journals over a 13-year period. We document our attempts to procure those data (from online archives and by direct request to corresponding authors), and report results of analyses (using Bayesian logistic regression) to assess the impact of various factors on the success of our efforts. Overall, complete phylogenetic data for ~60% of these studies are effectively lost to science. Our study indicates that phylogenetic data are more likely to be deposited in online archives and/or shared upon request when: (1) the publishing journal has a strong data-sharing policy; (2) the publishing journal has a higher impact factor, and; (3) the data are requested from faculty rather than students. Although the situation appears dire, our analyses suggest that it is far from hopeless: recent initiatives by the scientific community -- including policy changes by journals and funding agencies -- are improving the state of affairs

    TOLKIN – Tree of Life Knowledge and Information Network: Filling a Gap for Collaborative Research in Biological Systematics

    Get PDF
    The development of biological informatics infrastructure capable of supporting growing data management and analysis environments is an increasing need within the systematics biology community. Although significant progress has been made in recent years on developing new algorithms and tools for analyzing and visualizing large phylogenetic data and trees, implementation of these resources is often carried out by bioinformatics experts, using one-off scripts. Therefore, a gap exists in providing data management support for a large set of non-technical users. The TOLKIN project (Tree of Life Knowledge and Information Network) addresses this need by supporting capabilities to manage, integrate, and provide public access to molecular, morphological, and biocollections data and research outcomes through a collaborative, web application. This data management framework allows aggregation and import of sequences, underlying documentation about their source, including vouchers, tissues, and DNA extraction. It combines features of LIMS and workflow environments by supporting management at the level of individual observations, sequences, and specimens, as well as assembly and versioning of data sets used in phylogenetic inference. As a web application, the system provides multi-user support that obviates current practices of sharing data sets as files or spreadsheets via email

    Genome-wide SNP identification by high-throughput sequencing and selective mapping allows sequence assembly positioning using a framework genetic linkage map

    Get PDF
    <p>Abstract</p> <p>Background</p> <p>Determining the position and order of contigs and scaffolds from a genome assembly within an organism's genome remains a technical challenge in a majority of sequencing projects. In order to exploit contemporary technologies for DNA sequencing, we developed a strategy for whole genome single nucleotide polymorphism sequencing allowing the positioning of sequence contigs onto a linkage map using the bin mapping method.</p> <p>Results</p> <p>The strategy was tested on a draft genome of the fungal pathogen <it>Venturia inaequalis</it>, the causal agent of apple scab, and further validated using sequence contigs derived from the diploid plant genome <it>Fragaria vesca</it>. Using our novel method we were able to anchor 70% and 92% of sequences assemblies for <it>V. inaequalis </it>and <it>F. vesca</it>, respectively, to genetic linkage maps.</p> <p>Conclusions</p> <p>We demonstrated the utility of this approach by accurately determining the bin map positions of the majority of the large sequence contigs from each genome sequence and validated our method by mapping single sequence repeat markers derived from sequence contigs on a full mapping population.</p

    Genome-Wide Distribution and Organization of Microsatellites in Plants: An Insight into Marker Development in Brachypodium

    Get PDF
    Plant genomes are complex and contain large amounts of repetitive DNA including microsatellites that are distributed across entire genomes. Whole genome sequences of several monocot and dicot plants that are available in the public domain provide an opportunity to study the origin, distribution and evolution of microsatellites, and also facilitate the development of new molecular markers. In the present investigation, a genome-wide analysis of microsatellite distribution in monocots (Brachypodium, sorghum and rice) and dicots (Arabidopsis, Medicago and Populus) was performed. A total of 797,863 simple sequence repeats (SSRs) were identified in the whole genome sequences of six plant species. Characterization of these SSRs revealed that mono-nucleotide repeats were the most abundant repeats, and that the frequency of repeats decreased with increase in motif length both in monocots and dicots. However, the frequency of SSRs was higher in dicots than in monocots both for nuclear and chloroplast genomes. Interestingly, GC-rich repeats were the dominant repeats only in monocots, with the majority of them being present in the coding region. These coding GC-rich repeats were found to be involved in different biological processes, predominantly binding activities. In addition, a set of 22,879 SSR markers that were validated by e-PCR were developed and mapped on different chromosomes in Brachypodium for the first time, with a frequency of 101 SSR markers per Mb. Experimental validation of 55 markers showed successful amplification of 80% SSR markers in 16 Brachypodium accessions. An online database ‘BraMi’ (Brachypodium microsatellite markers) of these genome-wide SSR markers was developed and made available in the public domain. The observed differential patterns of SSR marker distribution would be useful for studying microsatellite evolution in a monocot–dicot system. SSR markers developed in this study would be helpful for genomic studies in Brachypodium and related grass species, especially for the map based cloning of the candidate gene(s)

    Identification of conserved gene clusters in multiple genomes based on synteny and homology

    Get PDF
    <p>Abstract</p> <p>Background</p> <p>Uncovering the relationship between the conserved chromosomal segments and the functional relatedness of elements within these segments is an important question in computational genomics. We build upon the series of works on <it>gene teams</it> and <it>homology teams.</it></p> <p>Results</p> <p>Our primary contribution is a local sliding-window SYNS (SYNtenic teamS) algorithm that refines an existing family structure into orthologous sub-families by analyzing the neighborhoods around the members of a given family with a locally sliding window. The neighborhood analysis is done by computing conserved gene clusters. We evaluate our algorithm on the existing homologous families from the Genolevures database over five genomes of the Hemyascomycete phylum.</p> <p>Conclusions</p> <p>The result is an efficient algorithm that works on multiple genomes, considers paralogous copies of genes and is able to uncover orthologous clusters even in distant genomes. Resulting orthologous clusters are comparable to those obtained by manual curation.</p

    Selection of a core set of RILs from Forrest × Williams 82 to develop a framework map in soybean

    Get PDF
    Soybean BAC-based physical maps provide a useful platform for gene and QTL map-based cloning, EST mapping, marker development, genome sequencing, and comparative genomic research. Soybean physical maps for “Forrest” and “Williams 82” representing the southern and northern US soybean germplasm base, respectively, have been constructed with different fingerprinting methods. These physical maps are complementary for coverage of gaps on the 20 soybean linkage groups. More than 5,000 genetic markers have been anchored onto the Williams 82 physical map, but only a limited number of markers have been anchored to the Forrest physical map. A mapping population of Forrest × Williams 82 made up of 1,025 F8 recombinant inbred lines (RILs) was used to construct a reference genetic map. A framework map with almost 1,000 genetic markers was constructed using a core set of these RILs. The core set of the population was evaluated with the theoretical population using equality, symmetry and representativeness tests. A high-resolution genetic map will allow integration and utilization of the physical maps to target QTL regions of interest, and to place a larger number of markers into a map in a more efficient way using a core set of RILs

    A pipeline for high throughput detection and mapping of SNPs from EST databases

    Get PDF
    Single nucleotide polymorphisms (SNPs) represent the most abundant type of genetic variation that can be used as molecular markers. The SNPs that are hidden in sequence databases can be unlocked using bioinformatic tools. For efficient application of these SNPs, the sequence set should be error-free as much as possible, targeting single loci and suitable for the SNP scoring platform of choice. We have developed a pipeline to effectively mine SNPs from public EST databases with or without quality information using QualitySNP software, select reliable SNP and prepare the loci for analysis on the Illumina GoldenGate genotyping platform. The applicability of the pipeline was demonstrated using publicly available potato EST data, genotyping individuals from two diploid mapping populations and subsequently mapping the SNP markers (putative genes) in both populations. Over 7000 reliable SNPs were identified that met the criteria for genotyping on the GoldenGate platform. Of the 384 SNPs on the SNP array approximately 12% dropped out. For the two potato mapping populations 165 and 185 SNPs segregating SNP loci could be mapped on the respective genetic maps, illustrating the effectiveness of our pipeline for SNP selection and validation

    Analysis of high-identity segmental duplications in the grapevine genome

    Get PDF
    <p>Abstract</p> <p>Background</p> <p>Segmental duplications (SDs) are blocks of genomic sequence of 1-200 kb that map to different loci in a genome and share a sequence identity > 90%. SDs show at the sequence level the same characteristics as other regions of the human genome: they contain both high-copy repeats and gene sequences. SDs play an important role in genome plasticity by creating new genes and modeling genome structure. Although data is plentiful for mammals, not much was known about the representation of SDs in plant genomes. In this regard, we performed a genome-wide analysis of high-identity SDs on the sequenced grapevine (<it>Vitis vinifera</it>) genome (PN40024).</p> <p>Results</p> <p>We demonstrate that recent SDs (> 94% identity and >= 10 kb in size) are a relevant component of the grapevine genome (85 Mb, 17% of the genome sequence). We detected mitochondrial and plastid DNA and genes (10% of gene annotation) in segmentally duplicated regions of the nuclear genome. In particular, the nine highest copy number genes have a copy in either or both organelle genomes. Further we showed that several duplicated genes take part in the biosynthesis of compounds involved in plant response to environmental stress.</p> <p>Conclusions</p> <p>These data show the great influence of SDs and organelle DNA transfers in modeling the <it>Vitis vinifera </it>nuclear DNA structure as well as the impact of SDs in contributing to the adaptive capacity of grapevine and the nutritional content of grape products through genome variation. This study represents a step forward in the full characterization of duplicated genes important for grapevine cultural needs and human health.</p

    A physical map of Brassica oleracea shows complexity of chromosomal changes following recursive paleopolyploidizations

    Get PDF
    <p>Abstract</p> <p>Background</p> <p>Evolution of the Brassica species has been recursively affected by polyploidy events, and comparison to their relative, <it>Arabidopsis thaliana</it>, provides means to explore their genomic complexity.</p> <p>Results</p> <p>A genome-wide physical map of a rapid-cycling strain of <it>B. oleracea </it>was constructed by integrating high-information-content fingerprinting (HICF) of Bacterial Artificial Chromosome (BAC) clones with hybridization to sequence-tagged probes. Using 2907 contigs of two or more BACs, we performed several lines of comparative genomic analysis. Interspecific DNA synteny is much better preserved in euchromatin than heterochromatin, showing the qualitative difference in evolution of these respective genomic domains. About 67% of contigs can be aligned to the Arabidopsis genome, with 96.5% corresponding to euchromatic regions, and 3.5% (shown to contain repetitive sequences) to pericentromeric regions. Overgo probe hybridization data showed that contigs aligned to Arabidopsis euchromatin contain ~80% of low-copy-number genes, while genes with high copy number are much more frequently associated with pericentromeric regions. We identified 39 interchromosomal breakpoints during the diversification of <it>B. oleracea </it>and <it>Arabidopsis thaliana</it>, a relatively high level of genomic change since their divergence. Comparison of the <it>B. oleracea </it>physical map with Arabidopsis and other available eudicot genomes showed appreciable 'shadowing' produced by more ancient polyploidies, resulting in a web of relatedness among contigs which increased genomic complexity.</p> <p>Conclusions</p> <p>A high-resolution genetically-anchored physical map sheds light on Brassica genome organization and advances positional cloning of specific genes, and may help to validate genome sequence assembly and alignment to chromosomes.</p> <p>All the physical mapping data is freely shared at a WebFPC site (<url>http://lulu.pgml.uga.edu/fpc/WebAGCoL/brassica/WebFPC/</url>; Temporarily password-protected: account: pgml; password: 123qwe123.</p
    corecore